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Abstract. Fix a sequence c = (ci, . . . , c n ) of non-negative integers with sum n — 1. We say 
a rooted tree T has child sequence c if it is possible to order the nodes of T as Vi, . . . , v n so 
that for each 1 < i < n, Vi has exactly a children. Let T be a plane tree drawn uniformly at 
random from among all plane trees with child sequence c. In this note we prove sub- Gaussian 
tail bounds on the height (greatest depth of any node) and width (greatest number of nodes at 
any single depth) of T. These bounds are optimal up to the constant in the exponent when c 
satisfies $^™=i c i ~ 0{n); the latter can be viewed as a "finite variance" condition for the child 
sequence. 



1. Introduction 

For a positive integer n, let c = (cj)f =1 be a sequence of non-negative integers whose sum is 
n — 1 (we call such a sequence a child sequence). In this paper we consider the random plane 
tree Tc, chosen uniformly at random from the set of plane trees (rooted ordered trees) T with n 
nodes for which, for some ordering v\, . . . , v n of the nodes of T, node Vi has Cj children, for each 
i £ [n] = {1, . . . , n}. The number of such trees is 

In! 

where n k = n k (c) = #{i : q 
invariants 

|c| = 

For a given tree T and non- negative integer i, write Zi(T) for the number of nodes of T at 
distance i from the root. We then define 

tu(T) = max{Zi(T) : i G N}, h(T) = m&x{i : Z^T) / 0}, 

and call w(T) and h(T) the width and height of T, respectively. The main results of the paper 
are the following sub-Gaussian tail bounds on the width and height of T c , whose strength is 
controlled by the above invariants. 

Theorem 1. For any n > 1 and all m > 1, 

P MT C ) > m + 2} < 3e — 2 /(i472|cp) ^ p {h{%) > m} < 7e - m V(23552|c|^)_ 



k} (see e.g., [10]). For a given child sequence c, we define the 



c| | and l c 



n — 1 — rai(c) 



fate: September 22, 2010. 

2000 Mathematics Subject Classification. 60C05. 



2 



L. ADDARIO-BERRY 



Remarks. 

★ When | c | 2 = O(n), this result is best possible up to the constants in the exponents. For the 
width, this follows from a connection, explained below, between the width and the fluctuations 
of random lattice paths. For the height, consider for example the special case where n = 2m + 1 
and c consists of m twos and m + 1 zeros. Then T c is a uniformly random binary plane tree, 
and in this case our bound (and the fact that it is tight) is a well-known result of Flajolet, Gao, 
Odlyzko and Richmond [5, Theorem 1.3]. 

★ A result related to Theorem 1 appears in [1]. Fix a random variable B with E£> = 1 and 
0Var{£?} < oo. Then, for n > 1, let T n be a Galton-Watson tree with offspring distribution 
B, conditioned to have total progeny n. [1, Theorems 1.1 and 1.2] then state that, for some 
e > not depending on n, P {w(T n ) > t} < exp(— et 2 /n), and if additionally Var{B} > then 
P{/i(T n ) > t} < exp(— et 2 /n). The requirement that Var{£?} > excludes the degenerate case 
where P {B = 1} = 1. Note that if B x , . . . , B n are independent copies of B then E [X]^=i — 
n ■ Var{i?}, and so the finite variance condition would roughly correspond in our setting to a 
requirement that |c| 2 = 0(n). Now temporarily write C±, . . . , C n for the numbers of children of 
the nodes of T n (note that C±, . . . , C n are exchangeable, but are not independent — their sum is 
n—1 — and are not distributed as B). We conjecture that in fact ^ _1//2 (^r=i ^1 — Var {B}) 
has Gaussian tails. A proof of this would show that the main results of [1] can be recovered 
from Theorem 1. 

★ In forthcoming work [3], Broutin and Marckert use the tail bound for the height in Theorem 1 
as an ingredient in proving that, under suitable conditions on the child sequence, c, the tree T c 
converges in distribution to a Brownian continuum random tree after suitable rescaling. 

★ In [2], a bound very similar to the second bound of Theorem 1 was required, for the height 
of a uniformly random labelled rooted tree of a fixed size. This bound was a key step in 
establishing the existence of a distributional Gromov-Hausdorff scaling limit for the sequence 
of rescaled components of a critical Erdos-Renyi random graph G n)P when p = p(n) is in the 
critical window p — 1/n = 0(n~ 4 / 3 ). The results of this paper may thus be seen as a step 
towards establishing that the same scaling limit obtains for the sequence of components of a 
critical random graph with a given degree sequence [6, 7, 11]. This is a line of enquiry that we 
shall pursue in a future paper. 

•k The appearance of the term l c in the bound on the height is necessary. For example, the 
sequence c = (1, 1, . . . , 1, 0) corresponds to a unique rooted plane tree, of height n. (For technical 
convenience, we exclude this unique, degenerate case from consideration for the remainder of 
the paper. Note that for any other child sequence c, we have |c| > n.) More generally, given 
c, define the one-reduced sequence c*, obtained by suppressing all entries of c which are equal 
to one. If c has k entries which are equal to one, then a tree with distribution T c can then 
be generated from the tree T c * by repeatedly choosing a node v uniformly at random, then 
subdividing the edge between v and its parent (or, if v happens to be the root, then adding a 
new node above v and rerooting at this new node). Under this construction, each edge in T c * is 
subdivided k/(n — k) times on average, and this is precisely the factor encoded by l c . 

The remainder of the note is devoted to proving Theorem 1. We first briefly describe a 
family of bijective correspondences between rooted plane trees and certain lattice paths; these 
correspondences allow us to prove bounds for the height and width by studying the fluctuations 
of a certain martingale. We accomplish this bounding by using a martingale concentration result 
of McDiarmid [9], which appears as Theorem 4, below. This immediately yields the first bound 
in Theorem 1; the second requires a little further thought, and the use of a negative association 
result of Dubhashi [4]. Forthwith the details. 
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The Ulam Harris tree, breadth- first search, lex-DFS and rev-DFS. Below is a brief 
review of some basic connections between rooted plane trees and lattice paths. An excellent and 
detailed reference, with proofs, is [8]. The Ulam-Harris tree IA is the tree with root whose 
non-root nodes correspond to finite sequences of positive integers v\ . . . vt, with v\ . . . Vk having 
parent v\ . . . v^-i and children {^i . . . v^i '■ i £ {1, 2, . . .}}. For a node v of U we think of vi as 
the i'th child of v. Any rooted plane tree T in which all nodes have at most countably many 
children can be viewed as a subtree of U by sending the root of T to the root of U and using 
the ordering of children in T to recursively define an embedding of T into IA. 

Having viewed T as a subtree of IA, we now define three orderings on the nodes of T: 

(1) breadth- first search (or BFS) order lists the nodes of T in increasing order of depth, and 
for nodes of the same depth, in lexicographic order (so, for example, node 2, 3 would 
appear before 3, 1 but after 1,7); 

(2) lexicographic depth-first search ( or lex-DFS) order lists the nodes of T in lexicographic 
order; 

(3) reverse lexicographic depth-first search (or rev-DFS) is most easily described informally. 
Let T* be the mirror-image of T, and list the nodes of T in the order they (their mirror 
images) appear in a lexicographic depth- first search of T* . 

The use of rev-DFS to bound heights of trees was introduced in [1]. Each of these orders have 
the property that when a node v appears in the order, its parent in T has already appeared. 
For such orders, we may define a queue process, as follows. Given the order u±, . . . ,u n of the 
nodes of T, Let Qq = 1 and, for 1 < i < n, let Qi = Qi-i — 1 + c Ui , where c Ui is the number 
of children of Ui in T. Then Qi is the number of nodes u of the tree whose parent is among 
u±, . . . ,Ui but who are not themselves among u\ , . . . , it j . We will thus always have Qi > for 
i < n and Q n = 0. We write {Q\{T)}q =1 for the queue process on the BFS order of T, and 
likewise define {Q-(T)}" =0 and {QI (T)}q =1 for the lex-DFS and rev-DFS orders, respectively. 

Given the tree T, the preceding three processes are uniquely specified. Conversely, given any 
of the three sequences {Qf(T)}f =1 , x 6 b, l,r, the tree T can be recovered. For each x G b, l,r, 
this provides a bijection between rooted plane trees with n nodes, on the one hand, and child 
sequences (cj)f =1 with ^ 1<i<f .(cj — 1) > for all 1 < k < n. Call such sequences tree sequences. 

Given a sequence c = (cj))f =1 , set So = and Si = Si(c) = YTj=i( c j ~ 1) f° r * e M- Also, 
given a permutation a : [n] — > [n], write <r(c) for the sequence (c CT (j))™ =1 . For a given sequence 
c = (ci,...,c n ) of non-negative integers with sum n, there is a unique cyclic permutation 
a = a c : [n] — > [n] for which the sequence of partial sums <r(c) forms a tree sequence. (This 
fact yields a one-line proof of (1), above, by considering the number of permutations leaving c 
unchanged.) To be precise, a is the cyclic permutation sending k to n, where k is the least index 
at which the sequence (5 , j(c))™ =0 achieves its minimum overall value. Fix x E {b,l,r} and write 
T x {c) for the tree T corresponding to cr(c) under the x-bijection. It follows that letting r be a 
uniformly random permutation of [n], the tree T x (r(c)) is a uniformly random tree with child 
sequence c. Conversely, if T is a uniformly random tree with child sequence c, then (Qf (T))f =0 
is distributed as (Sj(<7c(C)))" =0 , where C = r(c) and r is a uniformly random permutation, 
independent of c. 

Extremes in a sequence and its permutations. In what follows, for positive integers p, q, 
we write (p) q = p mod q if q \ p and (p) q = q if q \ p. For this section, fix a child sequence 
c = (ci, . . . , c n ) and x £ {b, l,r}, and let a = a c . Note that (<Si(c))[L has Si > for all i < n 
precisely if c is a tree sequence. 

Lemma 2. J/maxo<i< n Si(<j(c)) = m then maxo<j< n |Si(c)| > m/2. 
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Proof. For x G [0, 2n], write x n = x if x < n and x n = x — n if cc > n. Since a is a cyclic shift, 
there is j so that cr(i) = (i + j) n for j G [n]. Let /c be the index at which Sk(cr(c)) = m, so that 



A' 

to 

i=l i=l 



J2( c «(i) - !) = Z]( c (i+0n - !)■ 



If i + * < n then Sj+j = Sj + to so either Sj < —m/2 or Sj+i > m/2. If j + i > n then 
= (<S* n — Sj) + Sj + i- n so either Sj < S n — m/2 = — 1 — m/2 or Sj + i_ n > m/2. □ 

Let a* be the cyclic permutation sending 1 to 1 + |_ n /2j- We then immediately have the 
following corollary. 

Corollary 3. If maxo<i< n Si(a(c)) = m then either 

max I Si I > m/4 or max l-Swiil > ml A. 

0<i< \n/2\ 0<i< fn/2] 

Proof. By Lemma 2, we have maxo<i< n |5j(c)| > m/2, so one of these two alternatives must 
occur. □ 

Martingales for the queue processes. We will use a martingale inequality that can be 
found in [9]. Let {Xi}™ =0 be a bounded martingale adapted to a filtration {J 7 ,,}^. Let V = 
Y!i=o var {X i+ i\Fi} , where 

var{X i+1 \Ti} := E [(X i+1 - Xi) 2 \Ti] = E [xf +1 \Ti] - X? 

is the predictable quadratic variation of Xi + \. Define 

t> = ess sup and b= max esssup(Xi + i — X^Ti). 

0<i<n— 1 

Then we have the following bound. 

Theorem 4 ([9], Theorem 3.15). For any t > 0, 



P < max Xi > t > < exp 



t 2 



o<i<n 4 " J " 2v(l + bt/(3v)) J ' 

In [9], this result is stated for P {X n > t} rather than for the supremum of the Xj as above. 
However, as noted by McDiarmid, the proof is based on bounding E [e n ] for suitably chosen 
h > 0. Since {e hX ',0 <i<n} is a submartingale, the version for the supremum in fact holds 
by a simple application of Doob's inequality. 

Now, fix a child sequence c = (ci, . . . , c n ) and x G {6, Z, r}. Let r : [n] — > [n] be a uniformly 
random permutation, and write C = (Ci, . . . , C n ) = (r(c)). 

For < k < n — 1 let n° k = #{i : Ci = k} = n k (c). For i > and < k < n — 1, define 

4 = 4(c) = 



4" 1 if Ci £ k, 
^l' 1 -! if Q = k. 



Then for all < i < re, ^fc=o 4 = n — *• Also, for each 1 < i < n, there is a single & with 
n k 7^ 4~ > an d furthermore, for this k, Si(C) = 5i_i(C) + — 1. Thus, for all < i < n, 



n—1 n—1 



kn\ + Si = kn° k — i = n — 1 — i. 



k=0 k=0 
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Writing Ti for the sigma- field generated by Sq, . . . , Si, we then have 

n-l 



E[S i+1 \T i ] = S i + Y,ti-l) 

3=0 



11 



J 



n — i 



Si + 1 



n — i 



and 



n-l 



E[Sf +1 \T i ]=J2(Si + (k-l)f 



k=0 



n — % 



s2 _ 25,(5, + 1) Efc=d(fc-1)H 



n — i 



n — i 



At this point it would be natural to turn to the study of the martingale whose value at time i is 
Si + X^'=o(^j + l)/( n — i)' or i n other words to subtract off the predictable part. However, this 
would require us to separately bound the sums of the (Sj + l)/(n — j), and a more direct route 
is to simply bound these summands directly. From the preceding equations, for i < n we have 



E 



Sj+i + 1 
(i + l) 



1^ 



7? 



5i + l 



5 4 + l 



5i + l 



n — (z + 1) (rt — i)(n — (i + 1)) n — i 

Here we take 0/0 = 1 by convention to deal with the term i = n — 1. Thus, Mj = (5j + l)/(n — i 
is an J^-martingale. Since 5j+i > Si — 1 for each i < ti, for i < [n/2\ we have 

2 5, + 1 5i + 1 



At 



i+l 



Si+i + 1 > Si + 1 



2 > 5 t + l 4 



n — (i + 1) n — (i + 1) n n — i (n — i)(n — (i + 1)) n n — i n 
which we will use below when applying Theorem 4. We also have 



Mi—, 



var 



{M i+l \Fi} = E[Mf +l \Fi 



Mi 



— 2 (e [5^|j;]+2E[5, +1 |j-,] + r 



n- (i + l) 
1 

(n-(i + l)Y 



< 



(n-(i + l)Y 



3 + 



3 + 



i^i=i c 



n — i 



n — i 

2 • 



n — i 
5. + 1 



(n — i)(n — (i + 1)) 



Writing a = Yl7=i c l/ n -> f° r * < L n /2j we obtain the bound 

4(3 + 2a) 



w{M m |J-;} < 



n- 



and so 



Ln/2j 



£ wor {Mi |7i_i} < 



4(3 + 2a) 



i=l 



7? 



It follows by applying Theorem 4 to {— Mj}j-™Q that for all t > 



P <! min Si<-(t + l)> < P < mm 

,0<i<|ra/2j J [0<i<Ln/2j 71 — 7 



< exp 



5i + l rj 
— — < — 

n 



exp 



n ■ 8(3 + 2o)(l + t/(3(3 + 2a)n)) 



8(3 + 2a)n + 8t/3J ' 



(2) 
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Recall that o~ c is the unique cyclic permutation a which makes o~(c) a tree sequence. We are 
now prepared for our principal bound on the fluctuations of {Si(a(c)), < i < n}. 



Theorem 5. For any non-negative integer m, 

P < max Si(a c (c)) > m + 2 > < 3exp 



2 

m 



^0<i<n J \ 368|c| 2 

Proof. First, since cr c (c) = <7c(C), it suffices to bound P {maxo<j< n S^o^C)) > m + 2}, which 
is what we shall do. Also, for m > n — 3 the event under consideration can never occur, so 
we may and shall assume m < n — 3. Finally, for this proof, by Si we mean Si(C) unless an 
argument is provided. 

First note that if maxo<j< n 5j(<r(C)) = m + 2, then 

max Si — min Si = m + 3. (3) 

0<i<n 0<i<n 

(In fact, the same must hold for any cyclic permutation of C.) This will imply that at some 
point, {Si,0 < i < n} drops in value significantly. Let tjiq = max <j<L n /2j Si, and consider the 
following two events. 

(a) mino^Ln/2] Si <-(m + 3)/3 

(b) S , [n/2J <m -(m + 3)/3. 

If (a) does not occur then {So, Si, ... , <S'Ln/2j } C (— (m + 3)/3, mo). Thus, if neither (a) nor (b) 
occur then for (3) to hold one of the following must take place. 

(c) m > 2(m + l)/3, 

(d) max|_ n /2j<j<„5j > 2(m + 3)/3, 

(e) min Ln/2 j<i< n Si < m - (m + 3). 

If (b) does not occur but (c) occurs then S n — S^ n / 2 \ < — (jn + 3)/3. If (d) occurs then since 
S n = -1, S n - max Ln/2 j <i < n Si < -2(m + 3)/3. 

Now note that if either (a) or (b) occurs then 

min (5i n / 2 j - S\ n / 2 \-i) < -(m + 3)/3, 

0<i< [n/2] 

and if (b) does not occur but one of (c),(d) does then S n — max^ n / 2 j < j< n Si < — (m + 3)/3, and 
so 

min (S n -S n -i) < -(m + 3)/3. 

0<i<[n/2] 

Finally, if (b) does not occur but (e) occurs then since 5*^/2] > tuq — (m + 3)/3, we have 

min (Si - Si n/2 1 ) < -2(m + 3)/3. 

\n/2\<i<n 1/1 

Since (S\ n /2\ — S\ n /2\-i,0 <i< \n/2\) has the same distribution as (Si,0 < i < [n/2\), and 
(Si — S\ n /2\, [n/2\ < i < n) and (S n — S n -i,0 < i < \n/2~\) both have the same distribution as 
(Si,0<i< \n/2]), it follows that 



P <^ max Si(a c (C)) >m + 2}<3P{ min Si < -(m/3 + 1) 

\0<i<n J lo<i<Ln/2j 



< 3exp 



2 

m 



72(3 + 2a)n + 8m J ' 



the latter bound holding by (2). Since m < n — 3, we have 72 • 3n + 8m < 224n < 224|c| 2 . Also, 
72 • 2an = 144|c| 2 , and the result follows. □ 
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Bounding the width and the height. The bounds of Theorem 1 follow straightforwardly 
from Theorem 5. Let T c be a uniformly random tree with child sequence c. As noted earlier, 
(Qi(T c ))i =0 is distributed as (Si(a c (C)))f =0 , where C = r(c) and r is a uniformly random 
permutation, independent of c. Furthermore, when the breadth-first exploration has just finished 
exploring all the nodes at depth k, the queue length is precisely the number of nodes at depth 
k — 1. It follows by Theorem 5 that 



P {w{%) >m + 2}<P\ max Q\{%) > m + 2 \ < 3 exp 



m 



o<i<n iV c > - j - * \ 368|c| 2 

proving the bound for the width. (Also, if at some point the queue length is at least m then 
w(7~c) > m/2, from which the optimality of the with bound when |c| 2 = 0(n) follows straight- 
forwardly.) 

In bounding the height, we assume that m > or else the bound follows trivially since 

| c | 2 > n. First suppose that c is one-reduced (so has no entries equal to one). For any node 
u € T, let X(u) (resp. piu)) be the index of u when the nodes of T are listed in lex-DFS order 
(resp. rev-DFS order). Since c is one-reduced, each ancestor of u in T has at least one child 
that is not an ancestor of tt, and so either Q l \( u \(T) or Q r p ( u \(T) is at least \u\/2. It follows that 
when c is one-reduced, 

P{h(T c ) > m + 4} < P | max Q\{T C ) > \m/2\ + 2I + P j max Q\(%) > \m/2] + 2I 



< 6 exp 



proving the bound in this case. 



2 

m 



1472|cl 2 



More generally, write c* for the one-reduced version of c, obtained from c by removing all 
entries that are equal to one, and let n* be the length (number of elements) of c*. Also, write 
T* for the tree obtained from T c by replacing each maximal path whose internal nodes all 
have exactly one child, by a single edge. List the edges of T* according to some fixed rule as 
(e±, . . . , e n *-\). Note that we always have n* < n — 1. Each edge ei corresponds to some path 
in T c , and we write for the number of internal nodes of this path (i.e. the total number of 
nodes, minus two). Then T* is distributed as a uniformly random tree with child sequence c*, 
and, independently of 7~*, (si, . . . , s n *_i) is a uniformly random element of the set of vectors of 
non- negative integers of length n* — 1 with sum n — n* — 1. From Theorem 5 we thus have 



2 \ / 2 

m \ I m 



P{h(T*) >m + 4}< 6exp [~j^f ) ^ 6eXp [~ 147^2 ) ■ ( 4 ) 

If n* > n — -y/n then h(T c ) < h(T*) + \/n, and in this case the required bound follows (recall 
that we have shown we may assume m > 6^/n). In what follows we thus assume n* < n — i/n. 

By Proposition 5 of [4], the entries of (s\, . . . , s n *_i) are negatively correlated and thus stan- 
dard Chernoff bounds apply to any restricted sum of elements of (s±, . . . , s n *_i). In particular, 
for any node v of T* , 

J v = {i : ei is an edge of the path from v to the root of T*}. 

We always have J v < h(T*) — 1, and thus by a Chernoff bound (e.g., [9], Theorem 2.2), 

n — n* — 1 



P<S Jv > (l + x)(m + 2) 



n* — 1 



? < exp ( - 



h(T*) < m + 3 } < exp -2x\m + 2)' 



n* — 1 



8 



L. ADDARIO-BERRY 



To get a clean final bound, we choose x so that (1 + x)(m + 2) = 2m. It then follows by a union 
bound that 

pLveT*: S Jv >2m n ' 



n* — 1 



h(T*) < m + 3 



< exp I log(n* — 1) — 2(m — 2^ 



, n — n* — 1 
n* — 1 



< exp I — (m — 2) 



,n — n* — 1 



< exp 



n* - 1 
m 2 (n — n* — 1) 
9|cP 



the second inequality holding since (m—2) 2 > n* — 1 andn— n* — 1 > y^n— 1 > log(n* — 1), and the 
third holding since |c| 2 > n > n* — 1 and (m—2) > m/3. Since m+4 < 2m = 2m{n* — l)/(n* — 1), 
it then follows from (4) that 



P ^ ft(7^) > 4 ^ 



n - 2 
n* — 1 



< 7 exp 



m 



1472|c| 



But (n — 2)/(n* — 1) = l c , and the result follows. 
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